Depth estimation for stereo image pairs
نویسندگان
چکیده
Philips has developed a multi-view autostereoscopic 3D television which is capable of displaying video in real 3D without the use of external glasses or any other devices. Creating 3D content for these screens is difficult because they require not only the video itself but also the depth map as input. Depth maps are computed from a multi-view recording of the video sequence. To extract the depth map we have to solve the so called stereo correspondence problem. We developed a model, which with the help of some additional human input, can solve it reasonably well. We started by casting the original problem into the form of an energy minimization problem. The energy of a depth map consists of a weighted combination of two terms, a data term which indicates how well the depth maps corresponds with the input images, and an interaction term which is a measure of smoothness. The problem is then converted into the maximization of the probability of a Markov Random Field (MRF), a much used concept in the field of Computer Vision. An iterative procedure to obtain an approximate solution is derived from a Varia-tional Mean Field approximation of the MRF. Several extensions are made to the resulting model in order to make it more suitable for the difficulties that arise in practical situations. These include a multi-scale approach, a gradient based automatic boundary detection algorithm, the possibility to incorporate (manually selected) occluded and boundary pixels in the computation of the depth map and the option of using input from three camera views instead of just two. Our implementation of the model allows for several ways of giving manual input in order to obtain improved depth maps, of which the most important is the manual correction of the automatically computed boundaries. Unclassified Conclusions: In this report we have described a model for computing a depth map from a set of stereo images. We started with a mathematically sound model by casting the problem in the form of an energy minimization problem. This minimization problem is not exactly solvable and an iterative procedure was developed in order to obtain an approximate solution. While this solution method gives reasonable results, we made several extensions to the model in order to overcome problems that present themselves in real world example images. An important guideline in adding these extensions was that we wanted our model to accept user input in …
منابع مشابه
Depth Estimation based on Stereo Camera Pairs of Color and Infrared using Cross-based Local Multipoint Filter
This paper proposes a method of acquiring threedimensional shape data about moving objects based on stereo camera pairs of the color and the infrared by projecting infrared dot patterns. Most of the methods that project infrared pattern estimate a single depth map by measuring the translation of a known pattern with an infrared camera. The aim of the proposed method is to make 3D models by inte...
متن کاملSelf-Supervised Siamese Learning on Stereo Image Pairs for Depth Estimation in Robotic Surgery
INTRODUCTION Robotic surgery has become a powerful tool for performing minimally invasive procedures, providing advantages in dexterity, precision, and 3D vision, over traditional surgery. One popular robotic system is the da Vinci surgical platform, which allows preoperative information to be incorporated into live procedures using Augmented Reality (AR). Scene depth estimation is a prerequisi...
متن کاملDirect Estimation of Motion and Extended Scene Structure from a Moving Stereo Rig
We describe a new method for motion estimation and 3D reconstruction from stereo image sequences obtained by a stereo rig moving through a rigid world. We show that given two stereo pairs, one can compute the motion of the stereo rig directly from the image derivatives (spatial and temporal). Correspondences are not required. One can then use the images from both pairs combined, to compute a de...
متن کاملDepth Estimation Analysis Using Sum of Absolute Difference Algorithm
Stereo vision is one of the methods that can yield depth information of the scene. It uses stereo image pairs from two cameras to produce disparity maps that can be easily turn into depth maps. It is assumed that both the left and right channels of the multiview image sequence are coded using block or object-based methods. A dynamic programming algorithm is used to estimate a disparity field be...
متن کاملDepth Estimation using Defocused Stereo Image Pairs
In this paper we propose a new method for estimating depth using a fusion of defocus and stereo, that relaxes the assumption of a pinhole model of the camera. It avoids the correspondence problem of stereo. Main advantage of this algorithm is simultaneous recovery of depth and image restoration. The depth (blur or disparity) in the scene and the intensity process in the focused image are indivi...
متن کاملFusion of stereo vision and Time-Of-Flight imaging for improved 3D estimation
This paper suggests an approach in fusing two 3D estimation techniques: stereo vision and Time-Of-Flight (TOF) imaging. By mapping the TOF-depth measurements to stereo disparities, the correspondence between the images from a fast TOF-camera and standard high resolution camera pair are found, so the TOF depth measurements can be linked to the image pairs. In the same framework, a method is deve...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009